A study of reinforcement learning with knowledge sharing for distributed autonomous system

نویسندگان

Kazuyuki Ito

Akio Gofuku

Yoshiaki Imoto

Mitsuo Takeshita

چکیده

Reinforcement learning is one of effect,ive controller for autonomous robots. Became it does not need priori knowledge and hehaviom to complete given tasks are obtained automatically by repeating trial and error. However a large number of trials are required to realize complex tasks. So the task that can be obt.ained using t.he real robot is restricted to simple ones. Comidering these points, various methods that improve the learninx cost of reinforcement learning had been proposed. In the method that uses miori knowledxe. the methods lose the autonomy that is most import&k feature of reinforcement learning in applying it to the robots. In the Dyna-Q, that is oiie of simple and effective reinforcement learning architecture integrat.ing online planning, a model of environment is learned from real experience and by utilizing the model to learn, the learning time is decreased. In this architecture, the autonomy is held, however the model depends on the task, so acquired knowledge of environment can not be reused to other tasks. In the real world, human beings can learn various behaviors to complete complex tasks without priori knowledge of the tasks. We can try to realize the task in our image without moving our body. After the training in the image, hy trying to the real environment., we save time to learn. It means that we have model of environment and %re utilize t.he model to learn. We consider that the key ability t.hat makes the learning process faster is construction of environment model and utilization of it.. In this paper. we have proposed a method to obtain an environment model that is independent of the task. And by utihing the model_ we have decreased learning time. We consider distributed autonomous agents; and we show that. the environment model is constructed quickly hy sharing the experience of each agent, even when each agent has own independent task. To demonstrate the effectiveness of the proposed method, we have applied the method to the Q-learning and simulations of a puddle world are carried out. As a result effective hehaviors have been obtained quickly. 0-7803-7866-0/03/$17.00 82003 IEEE 1120 Mitsuo Takeshita Dept. of Systems Engineering Fncluty of Engineering Okayama University

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

A Distributed Control Architecture for Autonomous Operation of a Hybrid AC/DC Microgrid System

Hybrid AC/DC microgrids facilitate the procedure of DC power connection into the conventional AC power system by developing the distributed generations (DGs) technologies. The conversion processes between AC and DC electrical powers are more convenient by hybrid systems. In this paper, an energy management system (EMS) for a hybrid microgrid network is proposed due to the optimal utilization of...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Relationship between Professional Ethics with Learning and Intentional Organizational Forgetting: Mediating Role of Sharing Knowledge ‎

Background: Todays, professional ethics was recommended one of the variables that can effect on another organization perspectives, therefore the aim of present study was to study the correlation of professional ethics with learning and intentional organizational forgetting in staff sport offices in Isfahan province, considering the mediating role of knowledge sharing. Method: This is an applied...

متن کامل

Learning to Coordinate without Sharing Information

Researchers in the eld of Distributed Arti cial Intelligence DAI have been developing e cient mechanisms to coordinate the activities of multi ple autonomous agents The need for coordina tion arises because agents have to share resources and expertise required to achieve their goals Previous work in the area includes using sophis ticated information exchange protocols investi gating heuristics ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

A study of reinforcement learning with knowledge sharing for distributed autonomous system

نویسندگان

چکیده

منابع مشابه

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

A Distributed Control Architecture for Autonomous Operation of a Hybrid AC/DC Microgrid System

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

Relationship between Professional Ethics with Learning and Intentional Organizational Forgetting: Mediating Role of Sharing Knowledge ‎

Learning to Coordinate without Sharing Information

عنوان ژورنال:

اشتراک گذاری